Structural group-based auditing of missing hierarchical relationships in UMLS

نویسندگان

  • Yan Chen
  • Huanying Gu
  • Yehoshua Perl
  • James Geller
چکیده

The Metathesaurus of the UMLS was created by integrating various source terminologies. The inter-concept relationships were either integrated into the UMLS from the source terminologies or specially generated. Due to the extensive size and inherent complexity of the Metathesaurus, the accidental omission of some hierarchical relationships was inevitable. We present a recursive procedure which allows a human expert, with the support of an algorithm, to locate missing hierarchical relationships. The procedure starts with a group of concepts with exactly the same (correct) semantic type assignments. It then partitions the concepts, based on child-of hierarchical relationships, into smaller, singly rooted, hierarchically connected subgroups. The auditor only needs to focus on the subgroups with very few concepts and their concepts with semantic type reassignments. The procedure was evaluated by comparing it with a comprehensive manual audit and it exhibits a perfect error recall.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Auditing hierarchical cycles to locate other inconsistencies in the UMLS.

A cycle in the parent relationship hierarchy of the UMLS is a configuration that effectively makes some concept(s) an ancestor of itself. Such a structural inconsistency can easily be found automatically. A previous strategy for disconnecting cycles is to break them with the deletion of one or more parent relationships-irrespective of the correctness of the deleted relationships. A methodology ...

متن کامل

Abstraction , Extension and Structural Auditing with the Umls Semantic Network

ABSTRACTION, EXTENSION AND STRUCTURAL AUDITING WITH THE UMLS SEMANTIC NETWORKION, EXTENSION AND STRUCTURAL AUDITING WITH THE UMLS SEMANTIC NETWORK

متن کامل

Analyzing polysemous concepts from a clinical perspective: Application to auditing concept categorization in the UMLS

OBJECTIVES Polysemy is a frequent issue in biomedical terminologies. In the Unified Medical Language System (UMLS), polysemous terms are either represented as several independent concepts, or clustered into a single, multiply-categorized concept. The objective of this study is to analyze polysemous concepts in the UMLS through their categorization and hierarchical relations for auditing purpose...

متن کامل

The cohesive metaschema: a higher-level abstraction of the UMLS Semantic Network

The Unified Medical Language System (UMLS) joins together a group of established medical terminologies in a unified knowledge representation framework. Two major resources of the UMLS are its Metathesaurus, containing a large number of concepts, and the Semantic Network (SN), containing semantic types and forming an abstraction of the Metathesaurus. However, the SN itself is large and complex a...

متن کامل

Circular hierarchical relationships in the UMLS: etiology, diagnosis, treatment, complications and prevention

The Unified Medical Language System (UMLS) is a large repository of some 800,000 concepts for the biomedical domain, organized by several millions of inter-concept relationships, either inherited from the source vocabularies, or specifically generated. This paper focuses on hierarchical relationships in the UMLS Metathesaurus, and especially, on circular hierarchical relationships. Using the me...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of biomedical informatics

دوره 42 3  شماره 

صفحات  -

تاریخ انتشار 2009